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ABSTRACT 

This paper proposes a web-based visual graph analytics platform 
for interactive graph mining, visualization, and real-time explo¬ 
ration of networks. Graph Vis is fast, intuitive, and flexible, com¬ 
bining interactive visualizations with analytic techniques to reveal 
important patterns and insights for sense making, reasoning, and 
decision making. Networks can be visualized and explored within 
seconds by simply drag-and-dropping a graph file into the web 
browser. The structure, properties, and patterns of the network are 
computed automatically and can be instantly explored in real-time. 
At the heart of Graph Vis lies a multi-level interactive network 
visualization and analytics engine that allows for real-time graph 
mining and exploration across multiple levels of granularity simul¬ 
taneously. Both the graph analytic and visualization techniques (at 
each level of granularity) are dynamic and interactive, with imme¬ 
diate and continuous visual feedback upon every user interaction 
(e.g., change of a slider for filtering). Furthermore, nodes, edges, 
and subgraphs are easily inserted, deleted or exported via a num¬ 
ber of novel techniques and tools that make it extremely easy and 
flexible for exploring, testing hypothesis, and understanding net¬ 
works in real-time over the web. A number of interactive visual 
graph analytic techniques are also proposed including interactive 
role discovery methods, community detection, as well as a num¬ 
ber of novel block models for generating graphs with community 
structure. Finally, we also highlight other key aspects including 
filtering, querying, ranking, manipulating, exporting, partitioning, 
as well as tools for dynamic network analysis and visualization, 
interactive graph generators, and a variety of multi-level network 
analysis, summarization, and statistical techniques. 


powerful statistical analysis, graph mining, and machine learning 
techniques with interactive visualization to aid in the discovery of 
important patterns and insights for sense making, reasoning, and 
decision making. 

Graph Vis is a visual analytics tool for network data, designed 
for rapid interactive visual exploration and graph mining (Figure 1). 
Starting from a simple drag-and-drop of a graph file into the browser, 
users can move from data to insights within seconds. Unlike other 
network visualization software that requires installation and up¬ 
dates, Graph Vis is web-based working directly from the browser. 
Furthermore, it is designed to be consistent with the way humans 
learn via immediate-feedback upon every user interaction (e.g., change 
of a slider for filtering) [1, 8]. Users have rapid, incremental, and 
reversible control over all graph queries with immediate and con¬ 
tinuous visual feedback. 

While Graph Vis serves as a web-based platform for visual in¬ 
teractive graph mining, it also has a number of important features 
that aid the discovery process including: 

• Drag-and-drop graph file(s) to quickly visualize and interac¬ 
tively explore networks in seconds. 

• Support for a wide variety of graph formats such as edge-lists 
(txt, csv, tsv, mtx, etc), XML-based formats (gexf, graphml), 
and a variety of others (gml, json, net/pajek). 

• Online profile and system to help manage your data, settings, 
visualizations, etc. 

• Export visualizations as high-quality images (SVG, PNG) as 
well as (transformed/filtered) graph data, attributes, etc (Fig. 2). 


Categories and Subject Descriptors 

G.2.2 [Graph theory]: Graph algorithms; H.2.8 [Database Ap¬ 
plications]: Data Mining; H.3.5 [Online Information Services]: 
Web-based services; D.4.7 [Organization and Design]: Interac¬ 
tive systems 
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1. INTRODUCTION 

Network analysis and graph mining play a prominent role in pro¬ 
viding insights and studying phenomena across various domains, 
including social, behavioral, biological, transportation, entertain¬ 
ment, and financial domains. This paper presents a web-based net¬ 
work visual analytics platform called GRAPH Vis that integrates 


• All macro and microscopic graph properties and statistics (tri¬ 
angles, kcore, etc.) are automatically updated in an efficient 



manner after each graph manipulation is performed such as in¬ 
serting and deleting nodes and links. 

• Mouseover nodes to analyze their microscopic properties (e.g., 
betweenness, PageRank, number of triangles). 

• Real-time visual graph filtering and querying capabilities in¬ 
cluding chaining multiple filters as well as exporting the result¬ 
ing data for other tasks. 

• To aid the analytic process, both nodes and edges may be col¬ 
ored and sized according to a variety of network properties (k- 
core number, eccentricity, etc) or customized by the user. 

• Subgraphs may be selected by brushing over interesting regions 
of the network visually (i.e., click-and-drag while holding shift). 
Multiple selections from different regions of the graph are also 
supported. 

• Brushed nodes, edges, and subgraphs may be deleted (by press¬ 
ing ‘D’), or manually adjusted by clicking on a selected node 
and dragging to the desired location. 

• Interactive graph generation including three newly proposed block 
model approaches that capture community structure. 

• Nodes, edges, and subgraph patterns (cliques, stars, chains, cy¬ 
cles, as well as probabilistic patterns via block models, etc) are 
also easily added with a simple click to allow for easy explo¬ 
ration, hypothesis testing (e.g., what if scenarios, as well as 
simulations to understand the impact of insertions/deletion on 
the local and global network), or simply to quickly generate a 
benchmark network for testing correctness of an algorithm. 

• Tools for dynamic network analysis and visualization. 

• Powerful graph mining and learning techniques are developed 
including interactive visual role discovery as well as advanced 
network analysis methods such as triangle-core decomposition. 

• Node information may also be updated easily via double-clicking 
the node. 

• Nearly all visualizations are interactive and support brushing, 
linking, zooming, panning, tooltips, etc. 

• Multiple visual representations of the graph data are supported, 
including the multi-level graph properties (e.g., interactive scat¬ 
ter plot matrix, and other statistical plots). 

• Network may also be searched via textual query (e.g., node 
name). 

• There are many other features including full customization of 
the visualization (color, size, opacity, background, fonts, etc), 
text annotation, graph layouts, collision detection, fish eye, and 
many others. 

2. MULTI-SCALE VISUAL ANALYTICS 

Visual analytic tools need to allow for interacting and reason¬ 
ing across multiple simultaneous scales of data representations [5]. 
Thus, we developed Graph Vis with a multi-scale visual graph 
analytics engine to support (visual) interactive network exploratory 
analysis at both the global macro-level as well as the local micro¬ 
scopic level. Visual graph mining and machine learning techniques 
lie at the heart of Graph Vis and provide the analysts with a set of 
powerful tools to discover key insights and reveal important struc¬ 
tural patterns interactively in real-time. Such an approach is vital 
for interactively exploring big data in real-time by summarizing its 
patterns, statistics (binning, distributions, etc), as well as spotting 



Figure 2: Exported SVG image from Graph Vis 

anomalies. Statistical techniques are used to find interesting nodes, 
allowing the user to sort through the top-k most interesting nodes 
for further investigation. 

Every update, insertion, or deletion of a node, edge, or subgraph 
is immediately reflected in the visualization window. Furthermore, 
the visualization and analytics are also updated immediately upon 
any parameter change via sliders or other interface controls. This 
allows to quickly test a hypothesis as well as investigate the impact 
of certain actions on the network structure and its properties/statis¬ 
tics. For instance, suppose we use betweenness to filter the graph, 
as we adjust the slider, the analyst receives visual feedback immedi¬ 
ately at each change in the slider (in contrast to adjusting the slider 
to the desired value, then receiving feedback on the selection 1 . 

2.1 Macro-level Interactive Graph Analysis 

At the macroscopic level, we use a variety of key network prop¬ 
erties. A few of these include max/avg degree, total triangles, 
global clustering, max k-core number, diameter, mean distance, ap¬ 
prox. chromatic number, number of communities/roles, and max 
triangle-core number. To help guide the interactive exploration, we 
display many of the important macro properties that help character¬ 
ize the global structure of the network in the visualization window. 

Moreover, statistical aggregates (mean, max, mode, sum, var) are 
used to summarize the global structure and behavior of the network. 

2.2 Micro-level Interactive Analysis 

To facilitate the discovery process, Graph Vis provides inter¬ 
active exploration at the microscopic level, e.g., using edge and 
node degree, eccentricity, k-core number, and triangle-core num¬ 
ber. In addition, many important social network analysis measures 
are used in the interactive visual analytics including betweenness, 

'This type of visual network analysis is also extremely useful for 
learning and education, since students can quickly grasp the behav¬ 
ior of the various statistical techniques through interactive explo¬ 
ration. 
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Figure 3: Node profile tools highlight important properties 
including microscopic and neighborhood information 


number of triangles, clustering coefficient, path lengths, PageRank, 
and many others. 

These node and edge properties are displayed in visual form and 
can be explored/manipulated directly by the user in a free-flowing 
manner (e.g., using brushing, linking, zooming, mouseover, filter¬ 
ing, etc). For instance, the neighborhood of a node can be high¬ 
lighted as well as its micro-level statistics and properties (Figure 3). 

Multiple visual representations of the graph data are also pro¬ 
vided. For example, GraphVis leverages an interactive scatter 
plot matrix for analysis of the correlation between pairs of node 
statistics (Figure 4). We also support brushing to allow users to 
highlight interesting nodes (and links) across the various measures. 
Furthermore, semantic zooming can be used to drill-down in order 
to understand the differences between individual nodes and links. 

Links may also be analyzed more closely using a similar ap¬ 
proach. A few important link measures are available including 
triangle-core numbers, number of triangles incident to an edge, as 
well as a variety of other measures. 

2.3 Graph Partitioning Methods 

GraphVis provides a diverse collection of visual interactive 
graph partitioning methods. For example, community detection, 
role discovery [7], and graph coloring. All graph partitioning meth¬ 
ods are designed to be efficient taking at most linear time in the 
number of edges to compute. 

2.4 Distributions of Measures 

Node and link summarization techniques (e.g., binning/histograms, 
statistical distributions) are used to obtain fast, meaningful and use¬ 
ful data representations. For example, we leverage binning meth¬ 
ods to interactively compute and maintain the frequency distribu¬ 
tion of some graph properties (e.g., degree distribution) upon any 
update, insertion, or deletion of a node, edge, or subgraph. Further¬ 
more, we also interactively plot the cumulative distribution func¬ 
tion (CDF) and the complementary CDF, which are easily com¬ 
puted from the frequency distribution. These are known to be im¬ 


Figure 4: Microscopic network properties may be interactively 
analyzed using scatter plot matrices. GraphVis supports in¬ 
teractive techniques such as brushing, linking, highlighting, as 
well as semantic zooming. 

portant for networks, capturing interesting structural properties (e.g., 
heavy-tailed distributions). Furthermore, we also utilize sampling [2] 
as well as fast ranking algorithms for displaying top-k nodes, links, 
and subgraphs to the user for further exploration. In addition to dis¬ 
tributions, the macro-level measures are also useful for big graph 
data and vital to the multi-level strategy offered by GraphVis. 

3. INTERACTIVE GRAPH GENERATION 

Graph generators are useful for simulations, testing algorithms, 
assumptions, benchmarks, etc [4]. The interactive graph generators 
developed in this work are broadly categorized into 

1) Model-based synthetic graph generation and visualization, us¬ 
ing standard models such as Erdos-Renyi [6], Chung-Lu (CL) [3], 
and preferential attachment (PA) [4]. 

2) Pattern-based synthetic graph generation and visualization, us- 



(a) Block PA (b) Block Chung-Lu 

Figure 5: Networks generated from the proposed Block-PA 
(BPA) and Block-CL (BCL) models. 




























ing subgraph patterns such as nodes, edges, cliques, stars, cy¬ 
cles, and chains. 

3) Hybrid synthetic graph generation and visualization that allows 
users to generate graphs using a standard model (such as Erdos- 
Renyi) in addition to adding certain patterns to the generated 
graph (e.g., cliques, and stars). 

For capturing community-structure, we proposed three additional 
block model approaches that combine multiple one-stage proba¬ 
bilistic models such as Chung-Lu, Erdos-Renyi, or preferential at¬ 
tachment by probabilistically creating inter-community edges. See 
Figure 5 for an illustration. Notably, these block-model approaches 
may also be used as patterns allowing the user to interactively ex¬ 
periment with adding various patterns to already existing networks 
in the visualization window. 



Figure 6: Screenshot of the enron email communication net¬ 
work using a window of one day. Here, we immediately see 
many of the major players involved in the Enron scandal. 


4. DYNAMIC NETWORK EXPLORATION 

Dynamic networks arise in many settings (e.g, email commu¬ 
nications). To understand the evolution and dynamic patterns in 
networks, Graph Vis provides tools to interactively analyze the 
evolution of the graph over time (Figure 6). In particular, users 
can filter temporal networks by date and time. Additionally, the 
time scale can be selected (via brushing) and adapted based on the 
application or data properties. This controls the range of time be¬ 
ing visualized. Using these tools, analysts can begin to understand 
the dynamics and trends present in the network (e.g., seasonality, 
spike, trends). Additionally, community detection, role discovery, 
and all other analytic techniques and tools may also be used for 
deeper understandings of the dynamics and network evolution. 

5. CONCLUSION 

This paper introduced a web-based interactive visual analytics 
platform for graph and network data. Our work is based on a multi¬ 
level visual analytics engine, designed for rapid interactive visual 
exploration and graph mining. Furthermore, it integrates macro and 
microscopic statistical techniques as well as graph mining and ma¬ 
chine learning with interactive visualization to aid in the discovery 
of important patterns and insights for sense making, reasoning, and 
decision making. 
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